智能论文笔记

Algorithm-Agnostic Interpretations for Clustering

Christian A. Scholbeck , Henri Funk , Giuseppe Casalicchio

分类：机器学习 | (统计)机器学习

2022-09-21

通常通过后处理，涉及降低和后续可视化来解释高维数据的聚类结果。这破坏了数据的含义并混淆了解释。我们提出了算法 - 敏捷的解释方法，以在缩小尺寸中解释聚类结果，同时保留数据的完整性。集群的置换特征重要性代表基于改组特征值并通过自定义分数功能衡量群集分配的变化的一般框架。集群的个体条件期望表明由于数据的变化而导致群集分配的观察变化。聚类的部分依赖性评估整个特征空间的群集分配的平均变化。所有方法都可以与能够通过软标签重新分配实例的任何聚类算法一起使用。与常见的后处理方法（例如主组件分析）相反，引入的方法保持了特征的原始结构。

translated by 谷歌翻译

Enhancing Explainability of Hyperparameter Optimization via Bayesian Algorithm Execution

Julia Moosbauer , Giuseppe Casalicchio , Marius Lindauer , Bernd Bischl

分类：机器学习 | (统计)机器学习

2022-06-11

尽管自动超参数优化（HPO）的所有好处，但大多数现代的HPO算法本身都是黑盒子。这使得很难理解导致所选配置，减少对HPO的信任，从而阻碍其广泛采用的决策过程。在这里，我们研究了HPO与可解释的机器学习（IML）方法（例如部分依赖图）的组合。但是，如果将这种方法天真地应用于HPO过程的实验数据，则优化器的潜在采样偏差会扭曲解释。我们提出了一种修改的HPO方法，该方法有效地平衡了对全局最佳W.R.T.的搜索。预测性能以及通过耦合贝叶斯优化和贝叶斯算法执行的基础黑框函数的IML解释的可靠估计。在神经网络的合成目标和HPO的基准情况下，我们证明我们的方法返回对基础黑盒的更可靠的解释，而不会损失优化性能。

translated by 谷歌翻译

Explaining Hyperparameter Optimization via Partial Dependence Plots

Julia Moosbauer , Julia Herbinger , Giuseppe Casalicchio , Marius Lindauer , Bernd Bischl

分类：机器学习 | (统计)机器学习

2021-11-08

自动化的HyperParameter优化（HPO）可以支持从业者在机器学习模型中获得峰值性能。然而，通常缺乏有价值的见解，以对不同的超参数对最终模型性能的影响。这种缺乏可解释性使得难以信任并理解自动化的HPO过程及其结果。我们建议使用可解释的机器学习（IML）从HPO中获得的实验数据与贝叶斯优化（BO）一起获得见解。 BO倾向于专注于具有潜在高性能配置的有前途的区域，从而诱导采样偏差。因此，许多IML技术，例如部分依赖曲线（PDP），承载产生偏置解释的风险。通过利用BO代理模型的后部不确定性，我们引入了具有估计置信带的PDP的变种。我们建议分区Quand参数空间以获得相关子区域的更自信和可靠的PDP。在一个实验研究中，我们为子区域内PDP的质量提高提供了定量证据。

translated by 谷歌翻译

OpenML Benchmarking Suites

Bernd Bischl , Giuseppe Casalicchio , Matthias Feurer , Pieter Gijsbers , Frank Hutter , Michel Lang , Rafael G. Mantovani , Jan N. van Rijn , Joaquin Vanschoren

分类： (统计)机器学习 | 机器学习

2017-08-11

机器学习研究取决于客观解释，可比和可重复的算法基准。我们倡导使用策划，全面套房的机器学习任务，以标准化基准的设置，执行和报告。我们通过帮助创建和利用这些基准套件的软件工具来实现这一目标。这些无缝集成到OpenML平台中，并通过Python，Java和R. OpenML基准套件（A）的接口访问，易于使用标准化的数据格式，API和客户端库; （b）附带的数据集具有广泛的元信息; （c）允许在未来的研究中共享和重复使用基准。然后，我们为分类提供了一个仔细的策划和实用的基准测试套件：OpenML策划分类基准测试套件2018（OpenML-CC18）。最后，我们讨论了使用案例和应用程序，这些案例和应用程序尤其展示了OpenML基准套件和OpenML-CC18的有用性。

translated by 谷歌翻译

A Segmentation Method for fluorescence images without a machine learning approach

Giuseppe Giacopelli , Michele Migliore , Domenico Tegolo

分类：计算机视觉 | 人工智能

2022-12-28

Background: Image analysis applications in digital pathology include various methods for segmenting regions of interest. Their identification is one of the most complex steps, and therefore of great interest for the study of robust methods that do not necessarily rely on a machine learning (ML) approach. Method: A fully automatic and optimized segmentation process for different datasets is a prerequisite for classifying and diagnosing Indirect ImmunoFluorescence (IIF) raw data. This study describes a deterministic computational neuroscience approach for identifying cells and nuclei. It is far from the conventional neural network approach, but it is equivalent to their quantitative and qualitative performance, and it is also solid to adversative noise. The method is robust, based on formally correct functions, and does not suffer from tuning on specific data sets. Results: This work demonstrates the robustness of the method against the variability of parameters, such as image size, mode, and signal-to-noise ratio. We validated the method on two datasets (Neuroblastoma and NucleusSegData) using images annotated by independent medical doctors. Conclusions: The definition of deterministic and formally correct methods, from a functional to a structural point of view, guarantees the achievement of optimized and functionally correct results. The excellent performance of our deterministic method (NeuronalAlg) to segment cells and nuclei from fluorescence images was measured with quantitative indicators and compared with those achieved by three published ML approaches.

translated by 谷歌翻译

TypeFormer: Transformers for Mobile Keystroke Biometrics

Giuseppe Stragapede , Paula Delgado-Santos , Ruben Tolosana , Ruben Vera-Rodriguez , Richard Guest , Aythami Morales

分类：计算机视觉

2022-12-26

The broad usage of mobile devices nowadays, the sensitiveness of the information contained in them, and the shortcomings of current mobile user authentication methods are calling for novel, secure, and unobtrusive solutions to verify the users' identity. In this article, we propose TypeFormer, a novel Transformer architecture to model free-text keystroke dynamics performed on mobile devices for the purpose of user authentication. The proposed model consists in Temporal and Channel Modules enclosing two Long Short-Term Memory (LSTM) recurrent layers, Gaussian Range Encoding (GRE), a multi-head Self-Attention mechanism, and a Block-Recurrent structure. Experimenting on one of the largest public databases to date, the Aalto mobile keystroke database, TypeFormer outperforms current state-of-the-art systems achieving Equal Error Rate (EER) values of 3.25% using only 5 enrolment sessions of 50 keystrokes each. In such way, we contribute to reducing the traditional performance gap of the challenging mobile free-text scenario with respect to its desktop and fixed-text counterparts. Additionally, we analyse the behaviour of the model with different experimental configurations such as the length of the keystroke sequences and the amount of enrolment sessions, showing margin for improvement with more enrolment data. Finally, a cross-database evaluation is carried out, demonstrating the robustness of the features extracted by TypeFormer in comparison with existing approaches.

translated by 谷歌翻译

The URW-KG: a Resource for Tackling the Underrepresentation of non-Western Writers

Marco Antonio Stranisci , Giuseppe Spillo , Cataldo Musto , Viviana Patti , Rossana Damiano

分类：自然语言处理

2022-12-21

Digital media have enabled the access to unprecedented literary knowledge. Authors, readers, and scholars are now able to discover and share an increasing amount of information about books and their authors. Notwithstanding, digital archives are still unbalanced: writers from non-Western countries are less represented, and such a condition leads to the perpetration of old forms of discrimination. In this paper, we present the Under-Represented Writers Knowledge Graph (URW-KG), a resource designed to explore and possibly amend this lack of representation by gathering and mapping information about works and authors from Wikidata and three other sources: Open Library, Goodreads, and Google Books. The experiments based on KG embeddings showed that the integrated information encoded in the graph allows scholars and users to be more easily exposed to non-Western literary works and authors with respect to Wikidata alone. This opens to the development of fairer and effective tools for author discovery and exploration.

translated by 谷歌翻译

Attend to the Right Context: A Plug-and-Play Module for Content-Controllable Summarization

Wen Xiao , Lesly Miculicich , Yang Liu , Pengcheng He , Giuseppe Carenini

分类：自然语言处理

2022-12-21

Content-Controllable Summarization generates summaries focused on the given controlling signals. Due to the lack of large-scale training corpora for the task, we propose a plug-and-play module RelAttn to adapt any general summarizers to the content-controllable summarization task. RelAttn first identifies the relevant content in the source documents, and then makes the model attend to the right context by directly steering the attention weight. We further apply an unsupervised online adaptive parameter searching algorithm to determine the degree of control in the zero-shot setting, while such parameters are learned in the few-shot setting. By applying the module to three backbone summarization models, experiments show that our method effectively improves all the summarizers, and outperforms the prefix-based method and a widely used plug-and-play model in both zero- and few-shot settings. Tellingly, more benefit is observed in the scenarios when more control is needed.

translated by 谷歌翻译

Inductive Attention for Video Action Anticipation

Tsung-Ming Tai , Giuseppe Fiameni , Cheng-Kuang Lee , Simon See , Oswald Lanz

分类：计算机视觉

2022-12-17

Anticipating future actions based on video observations is an important task in video understanding, which would be useful for some precautionary systems that require response time to react before an event occurs. Since the input in action anticipation is only pre-action frames, models do not have enough information about the target action; moreover, similar pre-action frames may lead to different futures. Consequently, any solution using existing action recognition models can only be suboptimal. Recently, researchers have proposed using a longer video context to remedy the insufficient information in pre-action intervals, as well as the self-attention to query past relevant moments to address the anticipation problem. However, the indirect use of video input features as the query might be inefficient, as it only serves as the proxy to the anticipation goal. To this end, we propose an inductive attention model, which transparently uses prior prediction as the query to derive the anticipation result by induction from past experience. Our method naturally considers the uncertainty of multiple futures via the many-to-many association. On the large-scale egocentric video datasets, our model not only shows consistently better performance than state of the art using the same backbone, and is competitive to the methods that employ a stronger backbone, but also superior efficiency in less model parameters.

translated by 谷歌翻译

Understanding Online Migration Decisions Following the Banning of Radical Communities

Giuseppe Russo , Manoel Horta Ribeiro , Giona Casiraghi , Luca Verginer

分类：自然语言处理

2022-12-09

The proliferation of radical online communities and their violent offshoots has sparked great societal concern. However, the current practice of banning such communities from mainstream platforms has unintended consequences: (I) the further radicalization of their members in fringe platforms where they migrate; and (ii) the spillover of harmful content from fringe back onto mainstream platforms. Here, in a large observational study on two banned subreddits, r/The\_Donald and r/fatpeoplehate, we examine how factors associated with the RECRO radicalization framework relate to users' migration decisions. Specifically, we quantify how these factors affect users' decisions to post on fringe platforms and, for those who do, whether they continue posting on the mainstream platform. Our results show that individual-level factors, those relating to the behavior of users, are associated with the decision to post on the fringe platform. Whereas social-level factors, users' connection with the radical community, only affect the propensity to be coactive on both platforms. Overall, our findings pave the way for evidence-based moderation policies, as the decisions to migrate and remain coactive amplify unintended consequences of community bans.

translated by 谷歌翻译